Indoor/Outdoor Audio Classification Using Foreground Speech Segmentation

نویسندگان

Banriskhem K. Khonglah

K. T. Deepak

S. R. Mahadeva Prasanna

چکیده

The task of indoor/ outdoor audio classification using foreground speech segmentation is attempted in this work. Foreground speech segmentation is the use of features to segment between foreground speech and background interfering sources like noise. Initially, the foreground and background segments are obtained from foreground speech segmentation by using the normalized autocorrelation peak strength (NAPS) of the zero frequency filtered signal (ZFFS) as a feature. The background segments are then considered for determining whether a particular segment is an indoor or outdoor audio sample. The mel frequency cepstral coefficients are obtained from the background segments of both the indoor and outdoor audio samples and are used to train the Support Vector Machine (SVM) classifier. The use of foreground speech segmentation gives a promising performance for the indoor/ outdoor audio classification task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Moving object segmentation by background subtraction and temporal analysis

In this paper, we address the problem of moving object segmentation using background subtraction. Solving this problem is very important for many applications: visual surveillance of both in outdoor and indoor environments, traffic control, behavior detection during sport activities, and so on. All these applications require as a first step, the detection of moving objects in the observed scene...

متن کامل

Fusing Complementary Operators to Enhance Foreground/Background Segmentation

Foreground/background segmentation is an active research area for moving object analysis. We combine two probabilistic approaches one of which estimates foreground/background probabilistic density and the other uses prior knowledge to decompose the colour space. The observed performance advantages are associated with the fusion of operators with completely different basis. Tests on outdoor and ...

متن کامل

On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video Soundtracks

A video‘s soundtrack is usually highly correlated to its content. Hence, audio-based techniques have recently emerged as a means for video concept detection complementary to visual analysis. Most state-of-the-art approaches rely on manual definition of predefined sound concepts such as “engine sounds”, “outdoor/indoor sounds”. These approaches come with three major drawbacks: manual definitions...

متن کامل

Enhanced foreground segmentation and tracking combining Bayesian background, shadow and foreground modeling

In this paper we present a foreground segmentation and tracking system for monocular static camera sequences and indoor scenarios that achieves correct foreground detection also in those complicated scenes where similarity between foreground and background colours appears. The work flow of the system is based on three main steps: An initial foreground detection performs a simple segmentation vi...

متن کامل

Robust Foreground Detection in Videos Using Adaptive Color Histogram Thresholding and Shadow Removal

Fundamental to advance video processing such as object tracking, gait recognition and video indexing is the issue of robust background and foreground segmentation. Several methods have been explored regarding this application, but they are either time or memory consuming or not so efficient in segmentation. This paper proposes an accurate and fast foreground detection technique for object track...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2017

Indoor/Outdoor Audio Classification Using Foreground Speech Segmentation

نویسندگان

چکیده

منابع مشابه

Moving object segmentation by background subtraction and temporal analysis

Fusing Complementary Operators to Enhance Foreground/Background Segmentation

On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video Soundtracks

Enhanced foreground segmentation and tracking combining Bayesian background, shadow and foreground modeling

Robust Foreground Detection in Videos Using Adaptive Color Histogram Thresholding and Shadow Removal

عنوان ژورنال:

اشتراک گذاری